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Abstract — An energy-efficient approach is presented for shap- 
ing a bit-interleaved low-density parity-check (LDPC) coded 
amplitude phase-shift keying (APSK) system. A subset of the 
interleaved bits output by a binary LDPC encoder are passed 
through a nonlinear shaping encoder whose output is more 
likely to be a zero than a one. The "shaping" bits are used 
to select from among a plurality of subconstellations, while 
the unshaped bits are used to select the symbol within the 
subconstellation. Because the shaping bits are biased, symbols 
from lower-energy subconstellations are selected more frequently 
than those from higher-energy subconstellations. An iterative 
decoder shares information among the LDPC decoder, APSK 
demapper, and shaping decoder. Information rates are computed 
for a discrete set of APSK ring radii and shaping bit probabilities, 
and the optimal combination of these parameters is identified 
for the additive white Gaussian noise (AWGN) channel. With 
the assistance of extrinsic-information transfer (EXIT) charts, 
the degree distributions of the LDPC code are optimized for use 
with the shaped APSK constellation. Simulation results show that 
the combination of shaping, degree-distribution optimization, and 
iterative decoding can achieve a gain in excess of 1 dB in AWGN 
at a rate of 3 bits/symbol compared with a system that does 
not use shaping, uses an unoptimized code from the DVB-S2 
standard, and does not iterate between decoder and demodulator. 



I. Introduction 

Amplitude phase-shift keying (APSK) is a modulation con- 
sisting of several concentric rings of signals, with each ring 
containing signals that are separated by a constant phase 
offset. APSK has recently become widely adopted, due pri- 
marily to its inclusion in the second generation of the Digital 
Video Broadcasting Satellite standard, DVB-S2 [1], where it 
is combined with low-density parity-check (LDPC) coding. 
APSK offers an attractive combination of spectral and energy 
efficiency, and is especially well suited for the nonlinear 
channels typical of satellite systems. 

For a given modulation order M, an APSK constellation is 
characterized by the number of rings, the number of signals in 
each ring, the relative radii of the rings, and the phase offset of 
the rings relative to each other. In J2 and O, these parameters 
are optimized with an information-theoretic technique involv- 
ing the maximization of the symmetric information rate (SIR), 
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which is the mutual information between channel input and 
output under the assumption of equally-likely input symbols. 
The SIR is a suitable objective function when the M symbols 
are selected with uniform probability, which is the case for 
many conventional systems, such as DVB-S2. However, the 
SIR is not necessarily equal to the capacity of the channel, 
which is the information rate optimized over all possible 
input distributions. Achieving the capacity of APSK requires 
a nonuniform distribution of symbols. In 0, a nonuniform 
symbol distribution is considered and optimal symbol proba- 
bilities are determined by maximizing the mutual information. 
However, [3] does not specify how these distributions can be 
achieved in a practical system. 

While it is difficult to induce an arbitrary symbol distri- 
bution, constellation shaping techniques, such as the ones 
proposed in J4|, can be used to bias the symbols and increase 
the information rate. Constellation shaping was a topic of 
intense research activity in the 1980's and early 1990's. Much 
of this early research activity was directed towards wireline 
modems, culminating in the V.34 modem standard [5). For a 
comprehensive survey of the developments leading up to the 
V.34 standard, the reader is directed to [6| and the references 
therein. However, because wireline modems support much 
larger constellations than wireless systems and because this 
early work predates modern capacity-approaching codes, most 
of this early work is not readily applicable to the problem of 
shaping small APSK constellations for use with LDPC codes. 

Recent work on constellation shaping has focused on its 
application to the smaller constellations typical of wireless 
and satellite systems. One modern solution is to match a 
variable-rate turbo code with a Huffman code to allow a 
nonequiprobable mapping of symbols [7|. A similar approach 
has been proposed that uses variable-rate LDPC codes 10. 
However, the requirement of a variable-rate encoder makes 
such solutions inconvenient. Another solution is to use coset 
coding to combine turbo coding with quadrature amplitude 
modulation (QAM) [9|. An approach that involves a binary- 
input-ternary-output turbo code and hexagonal modulation is 
proposed in iflOll . 

In ifTTl and lfT2l . Le Goff et al. propose a shaping tech- 
nique suitable for bit-interleaved systems typical of modern 
communication standards. The technique involves the use of 
the short nonlinear shaping codes described in |4| to select 
from among a plurality of subconstellations. The constellation 
is partitioned into subconstellations such that subconstellations 
with lower average energy are selected more frequently than 
constellations with higher energy. However, the results in 
IfTTl are limited to convolutionally coded 16-QAM, while 
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Fig. 1. System model. The upper portion is the transmitter block diagram, while the lower portion is the receiver block diagram. 



the results in ITS! are limited to turbo-coded PAM. Neither 
reference considers APSK, which, as shown in this paper, is 
particularly well suited for constellation shaping. 

In the first part of this paper, we build upon the information- 
theoretic optimization of APSK presented in lEH. Il3l and 
the practical shaping techniques proposed in ifTTl . |[T2| . By 
computing the information rates of shaped constellations, we 
jointly optimize parameters used by both the APSK constella- 
tion and the shaping code. To provide compatibility with DVB- 
S2, we limit the APSK constellation parameters to the finite set 
of parameters found in the DVB-S2 standard. Furthermore, we 
consider only short shaping codes in order to limit complexity. 

We then turn our attention to the implementation of an ac- 
tual system that uses both shaping and LDPC codes. Initially, 
we limit the choice of LDPC codes to be just the codes in 
the DVB-S2 standard. We then use the concept of extrinsic- 
information transfer (EXIT) charts |[T3l . |[T4l to optimize the 
LDPC code's degree distributions with respect to the shaped 
APSK modulation. Our bit-error rate (BER) results show that, 
at a rate of 3 bits/symbol and using 32-APSK modulation, 
the proposed system with shaping achieves a gain of 1.13 
dB relative to a more conventional DVB-S2 system. The 
gain is due to three factors: (1) use of an iterative receiver, 
which iterates between decoder and demodulator (i.e., a so- 
called BICM-ID receiver fl3), ESI), (2) shaping, and (3) a 
redesign of the LDPC code to account for the shaped-APSK 
modulation. The relative gain due to each of these three factors 
is 0.33 dB, 0.46 dB, and 0.34 dB, respectively. 

The remainder of this paper is organized as follows. A 
model for bit-interleaved coded APSK with constellation 
shaping is given in Section [TT] The section describes the 
operation of the nonlinear shaping encoder and the receiver. In 
Section Hill constellation shaping strategies are proposed for 
APSK constellations with mappings that are equivalent (up 
to a bitwise complement) to those in the DVB-S2 standard. 
In Section [IV] the shaping and modulation parameters are 
selected from a finite set in order to maximize the information 
rate. The implementation of an actual LDPC-coded system 
is presented in Section [V] where EXIT-based techniques are 
used to optimize the LDPC degree distributions. Finally, 
conclusions are given in Section [VI] 



II. System Model 

A. Transmitter 

A block diagram of the transmitter is shown in the upper 
part of Fig. Q] The system input is a length-i^ c vector b of 
equally-likely information bits, which is encoded by a rate 
R c = K c /N c binary LDPC encoder. The length- N c codeword 
u at the output of the LDPC encoder is permuted by a 
interleaver 111 to generate the interleaved codeword v. A bit 
separator arbitrarily separates v into two disjoint groups, d 
of length K s and S2 of length N c — K s . The vector d is 
segmented into L short blocks of length fc s , where K s = Lk s , 
and passed through a rate R s = k s /n s shaping encoder. The 
shaping encoder produces zeros more frequently than ones. 
The L short length-n s blocks at the output of the shaping 
encoder are concatenated to produce the length- N s vector c, 
where N s = Ln s . The vector c is then permuted by a second 
interleaver II2 to produce si. 

The vectors Si and S2 are together used by the APSK 
modulator to select symbols from the constellation X = 
{x%, xj\f}, where each constellation symbol is a complex 
scalar. The elements of Si are said to be shaping bits, because 
they are used to select from among several subconstellations 
with a nonuniform probability. This is in contrast with the 
unshaped bits 82, which select symbols from the selected 
subconstellation with uniform likelihood. Let g < m be the 
number of shaping bits per symbol, where m = log 2 (M). It 
follows that X is partitioned into 2 9 subconstellations, each of 
size 2 m ~ 9 . The variables g and m are related to the lengths 
of si and s 2 by N s / (N c - K s ) = g/ (m - g). 

Each symbol is selected from X according to the prescribed 
symbol mapping by using g bits from Si to select the sub- 
constellation and m — g bits from s 2 to select the symbol 
within the subconstellation. The symbols in X are normalized 
to have average energy £ s = 'Yl li=1 p{xi)\xi\ 2 , where p{xi) is 
the probability that Xi is selected. In Fig. Q] the two bitstreams 
are combined to form the vector z, where each group of m 
consecutive bits consists of g bits from Si and m — g bits 
from s 2 . The APSK modulator uses this input and the symbol 
labeling map to produce a vector x of coded symbols of length 
N = (N c - K s + N s )/m. 
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Let po denote the probability that a particular bit in si (or, 
equivalently, c) is equal to zero, and p\ be the probability 
that it is equal to one. The purpose of the shaping encoder 
is to produce an output with a particular po > 1/2. The 
codebook is constructed such that it contains the 2 ks distinct 
n s -tuples of lowest possible Hamming weight. Construction 
is a recursive process (4), with C initialized to contain the 
all-zeros codeword of length n s . Codewords of higher weight 
are recursively added to C until \C\ = 2 ka . Suppose that C 
contains all codewords of weight w — 1 or lower. During the 
next recursion, weight- w codewords are repetitively drawn and 
added to C until either the number of distinct codeword in C is 
2 ka or all weight-to codewords have been used. In the former 
case, the code construction is complete, while in the latter case 
it moves on to begin adding codewords of weight w + 1. 

As an example, consider the (n s ,k s ) = (9,7) code. The 
number of codewords in S is 2 7 = 128. There are 

GMMK) ■ ■« 

binary 9-tuples of weight three or less. It follows that S is 
a subset of these 9-tuples. In particular, S will contain all 9- 
tuples of weight-2 or less, and will contain 38 of the weight- 
3 9-tuples. It is preferable to select the weight-3 codewords 
in such a way that the column weights of matrix containing 
the codewords of C are balanced, thereby ensuring that po is 
approximately the same in every bit position. 

The overall rate of the system, R = K c /N, is the number 
of information bits per modulated symbol, and is related to 
the rates of the LDPC and shaping codes by: 

R = R c [m + g(R s -l)}. (1) 

When shaping is used, g > and R s < 1, which implies that 
for a fixed R, the rate of the LDPC code R c with shaping 
must be higher than the rate of the LDPC code when shaping 
is not used. 

B. Receiver 

The sequence of coded APSK symbols x is transmitted 
over an additive white Gaussian noise (AWGN) channel. After 
matched-filtering, the channel output corresponding to the k th 
signaling interval is represented by the complex scalar 

Vk = %k + n k (2) 

where x k is the k th element of x, and the {n^} are indepen- 
dent and identically distributed (i.i.d.) zero-mean circularly- 
symmetric complex Gaussian variables with power Nq, which 
is the one-sided noise-spectral density. The {yk} correspond- 
ing to the received codeword are collected into the vector y. 

The receiver, which is shown in the lower part of Fig. Q] 
processes the received vector y and produces an estimate b of 
the data bits. The receiver uses the principle of bit-interleaved 
coded modulation with iterative decoding (BICM-ID) [ 15 1, but 
includes an additional stage of processing required to decode 
the shaping code. The main components of the receiver are 
an APSK demodulator/demapper, a shaping decoder, and an 
LDPC decoder, each of which are implemented using the soft- 
input soft-output algorithm described below and connected 



by appropriate interleaves and deinterleavers. The BICM-ID 
receiver iterates among the demodulator, shaping decoder and 
LDPC decoder. The bit information exchanged in the receiver 
is represented as log-likelihood ratios (LLRs), following the 
convention that an LLR is the log of the probability of zero 
divided by probability of one. The LDPC decoder consists 
of a variable-node decoder (VND), an edge interleaver II3, 
and a check-node decoder (CND) lfl4l . We use L a to denote 
the a priori input to a module (which may possibly be fed 
back from another module), and use L e to denote the output 
extrinsic information. 

The received symbols y are passed into the APSK de- 
modulator, which demaps the symbols and produces a vector 
L e (z) containing the LLRs of the binary codeword z. The 
operation of the demodulator is described below in Section 
III-B1I By reversing the process used by the transmitter to 
combine si and S2, the LLR vector is separated by a serial- 
to-parallel (S/P) converter into two sequences, L e (si) an d 
L e (s2). The sequence L e (si), which represents the LLRs of 
the shaping bits s\, is de-interleaved to produce L a (c) and fed 
into the shaping decoder. The shaping decoder generates the 
a posteriori LLR L e (d) by using both L a (c) and the a priori 
input information L a (d), which is initially set to all zeros, but 
as described below, will contain information fed back from 
the LDPC decoder in subsequent iterations. The operation of 
the shaping decoder is described below in Section III-B2I A 
parallel to serial (P/S) converter combines the output of the 
shaping decoder L e (d) with L e (s2) to produce a vector L e (v), 
which represents the LLRs of vector v. The vector L e (v) 
is de-interleaved by IT^ 1 and the resulting vector L a (u) is 
introduced as the input to the LDPC decoder, which performs 
an iteration of standard sum-product decoding. 

The output of the LDPC decoder is fed back to the shap- 
ing decoder and APSK demodulator to be used as extrinsic 
information during the next iteration. The output L e (u) of the 
LDPC decoder is interleaved by ITi. The interleaved sequence 
L a (v) is split by a S/P converter into two sequences, L a (d) 
and L a (s2). The vector L a (d), which represents the a priori 
information on d, is passed into the shaping decoder, where 
after the initial iteration it is used as the a priori input. Another 
P/S converter, which is identical to the bit-combination block 
in the transmitter, combines the interleaved output of the 
shaping decoder L a (si) with the LLRs of the unshaped bits 
X a (s2) to create the vector L a (z), which is used as the a 
priori input to the demapper after the initial iteration (prior to 
the first iteration, L a (z) is initialized according to the average 
bit probabilities for each bit position, per Section Hi-Bil l. 

1 ) The Demodulator: The demodulator is implemented on 
a symbol-by-symbol basis. For ease of exposition, we drop 
the dependence on the symbol interval in this subsection, so 
that symbols may be expressed without subscripts. During 
a particular symbol interval, the demodulator computes the 
LLRs L e (z) of the m code bits associated with the symbol. 
The inputs to the demodulator are the received complex 
symbol y, which is produced by a matched-filter front end, as 
well as the set of m a priori LLRs L a (z), which is extrinsic 
information generated by the shaping and LDPC decoders 
during the previous iteration. Prior to the first iteration, L a (z) 
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is initialized to L a (si) = log( ^ ) for the shaped bits and 
L a (s<i) — for the unshaped bits. 

Let the function fa (x) return the k th bit that labels symbol 
x. Using the MAP demodulator described in [12] and flTl . 
the a posteriori probability that fk(x) = q,q £ {0, 1}, is 



P(fk(x)=q\y) 



e n - 



71=1 



+ e L a (z n ) 



(3) 



where X£ is the subset of X containing those signals whose 
k th bit position is labeled with q. For the complex AWGN 
channel, the conditional probability of y given x is 



p(y\o 



l 



i 



— exp<-_|y-xr 



(4) 



Expressing the output as an LLR, substituting (0 and 
and canceling factors common to the numerator and 
denominator gives the output LLR 



L e (z k ) = In 



P(/ fc (a:)=(%) 
P{fa[x) = l\y) 



In 



E exp< 
x>ex° 


n±k J 


E ex P < 


Jy_^l + j2 fn{x/)La(Zn) > 

n=l 



(5) 



The above computation may be efficiently computed using the 
max-star operator ifTTll . 

2) The Shaping Decoder: The first LLR stream L e ( s i) is 
de-interleaved and fed into the shaping decoder as L a (c). 
The shaping decoder outputs the extrinsic LLRs L e (d) and 
L e (c) based on the input from the demodulator and the 
extrinsic information fed back from the LDPC decoder. The 
implementation of the shaping decoder is similar to that of the 
demodulator, but the summations are now over subsets of the 
shaping code rather than subsets of the signal constellation. 
The log-likelihood of each codeword is found by taking 
the inner product of the n s bit LLRs with each candidate 
codeword. Taking into account these differences, the output 
of the MAP decoder for the shaping code is 

^ n B k B \ 

;/ n (d')£ a (c„) + £ d ^(*) 
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e^k 



J 

(6) 



where V q k denotes the set of messages d whose k th bit position 
is labeled with q, q £ {0, 1}, and /„(d') is the n th bit in the 
codeword associated with message d'. 

The extrinsic information L e (c) produced by the shaping 
decoder can be implemented in a similar manner: 
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where denotes the shaping codewords c whose k th bit 
position is labeled with q, q E {0,1}, and f n (c') is the 
n th bit in the message associated with codeword c'. Note 
that, because there are more zeros than ones in the shaping 
codewords, |C"| > \C].\, and therefore there are more terms 
in the numerator of (0 than in the denominator. This is in 
contrast with ©, which has the same number of terms in the 
numerator and denominator since the message bits are equally 
likely to be or 1. 

III. Shaping Strategies 

A. Shaping for 16-APSK 

Consider the 16-APSK constellation, which uses the two 
innermost rings shown in Fig. [2] The inner ring contains 
4 symbols, while the outer ring contains 12 symbols. The 
bit mapping is as indicated on the figure. Note that this 
constellation is identical to the 16-APSK constellation in the 
DVB-S2 standard [ 1 1 and the mapping is identical except that 
the first two bits are complemented. The ratio of the radius 
of the outer ring to the radius of the inner ring is denoted 7, 
which according to the DVB-S2 standard may assume a value 
from the set {2.57, 2.60, 2.70, 2.75, 2.85, 3.15}. 

The minimum size partition in the shaping scheme should 
be equal to the number of minimum-energy signals, which in 
this case is four. It follows that the number of shaping bits 
may be either g = 1 or g = 2. When g = 1, the shaping bit 
is the first bit of the four-bit word labeling the constellation, 
and the constellation is partitioned into two subconstellations. 
The first subconstellation contains symbols labeled A and B\, 
while the second subconstellation contains symbols labeled B2 
and C\. The first set is selected with probability po, while the 
second set is selected with probability pi. 

When g = 2, the shaping bits are the first two bits of the 
word. The signal set is partitioned into the four subconstella- 
tions indicated in Fig. i.e., A, B\, B2, and C\. Set A is 
selected with probability p\, set B\ and B2 are each selected 
with probability popi, and set C\ is selected with probability 
p\. As po > pi, it follows that signals in set A are selected 
most often, and signals in set Ci are selected least often. 

B. Shaping for 32-APSK 

If all the symbols shown in Fig. are used, then the 
modulation is 32-APSK. The constellation consists of three 
concentric rings, with 4 symbols in the inner ring, 12 symbols 
in the middle ring, and 16 symbols in the outer ring. This 
constellation is identical to the 32-APSK constellation in the 
DVB-S2 standard H) and the mapping is identical except that 
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Fig. 2. APSK constellations. 16-APSK uses the signals shown in the two 
innermost rings with symbol mappings indicated in parenthesis. 32-APSK 
uses all of the shown signals. 

the first and last bits are complemented. The ratio of the 
radius of the middle ring to the radius of the inner ring is 
denoted 71, while the ratio of the radius of the outer ring to 
the radius of the inner ring is denoted 72. According to the 
standard, the value of 7 = {71,72} must be one of the fol- 
lowing: {2.53,4.30}, {2.54,4.33}, {2.64,4.64}, {2.72,4.87}, 
or {2.84,5.27}. 

As with 16-APSK, the number of minimum energy signals 
is four, which is the size of the smallest partition. The number 
of shaping bits should be no more than three. When g = 1, 
the shaping bit is the second bit of the word. This divides the 
constellation into two partitions. The first partition, which is 
selected with probability po, contains the signals in the first 
two rings (i.e., sets A, B\, B2, and Ci). The second partition, 
which is selected with probability p\, contains the signals in 
the outer ring (i.e., sets B3, C2, C3, and D). With this scheme, 
signals in the inner two rings are more likely to be selected 
than signals in the outer ring. 

When g = 2, the shaping bits are the second and last bits of 
the word. The signal set is partitioned into four sets: {A, Bi}, 
{B 2 ,C7i}, {S 3 ,C7 2 }, and {C 3 ,D}. The partition {A,Bx} is 
selected with probability p$, making it most likely to be 
selected. The partition {C3, D} is selected with probability p\, 
making it least likely to be selected. The other two partitions 
are each selected with probability pop\. 

When g = 3, the shaping bits are the first, second, and last 
bits of the word. The signal set is partitioned into the eight 
sets indicated in Fig. [2 i.e., A, B\, B 2 , B 3 , C\, C2, C3, and 
D. Partition A is most likely and is selected with probability 
Pq. The Bk partitions are selected with probability p\p\, while 
the Ck partitions are selected with probability pop\. Finally, 
the D partition is selected with probability p\, which makes 
it least likely to be selected. 

IV. Joint Parameter Optimization 

With the shaping techniques described in the previous 
section, energy may be conserved by using larger values of 
Pq. However, larger values of po generally require lower R s . 



To maintain a fixed overall rate R, using a lower R s requires 
a larger value of R c , which weakens the effectiveness of the 
LDPC code. Clearly there is a tradeoff between R s and R c . To 
determine the optimal tradeoff, we turn to information theory. 

A. Optimization with Respect to CM Capacity 

In performing the optimization, we wish to determine 
achievable information rates by computing the mutual infor- 
mation between the channel input and output. On the one 
hand, we could compute the mutual information between the 
modulated symbols at the channel input and the corresponding 
output. This is called the coded modulation (CM) capacity 
in lfT8l . On the other hand, we could compute the mutual 
information between the bits at the input of the modulator 
and output of the demodulator. In the case of a shaped 
system, the modulator would include the shaping encoder. 
This type of capacity is called the BICM capacity in |18|. 
The distinction between the two is that the BICM capacity is 
relevant for a system that uses a BICM receiver; i.e., doesn't 
feed back information from decoder to demodulator. However, 
our system uses a BICM-ID receiver and is therefore able to 
outperform a BICM receiver and achieve performance close 
to that of CM. Therefore, we will perform the optimization 
with respect to the CM capacity. 

Let Y be the output of an AWGN channel with complex 
scalar input X 6 X. The capacity of the channel is [19| 



C = m&xI(X;Y) 

p(x) 



(8) 



where p{x) is the probability mass function (pmf) of X, the 
information rate (also called average mutual information) is 



and 



I(X;Y) = E[i(X;Y)} 



■ 1 x , p(x,y) . p(y\x) 

1 (x; y) = log = log ■ 



(9) 



(10) 



' p{x)p(y) p(y) 

Note that the expectation in (0 is with respect to the joint pdf 
p(x,y). When a base-2 logarithm is used, then ^ has units 
of bits per channel use (bpcu). 

From the theorem on total probability, 



p(y) 

Substituting (TTll into (TITJb gives 



^2 p(y\ x ')p( x ')- 

x'ex 



(11) 



i{x;y) 



\ogp(y\x) ~ log p(v\ x ')p( x ') -(12) 



\x'ex 



where p{y\x) is given by for the AWGN channel. 

The capacity is found by maximizing the information 
rate with respect to the distribution of X. In general, the 
optimization requires that the probability of occurrence of 
each of the symbols be independently varied. Techniques for 
finding the optimizing distributions may be readily found in 
the literature, such as the Blahut-Arimoto algorithm |20|. 
However, the shaping scheme proposed in Section [Til] does 
not allow for independent symbol probabilities. Rather, the 
symbol probabilities are related by the value of po, the number 
of shaping bits, and the symbol labeling. 
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Fig. 3. Information rate (in bpcu) of 16-APSK and 32-APSK over an AWGN 
channel. The number of shaping bits used is indicated. The solid line on the 
right of each group corresponds to a uniform input distribution (no shaping). 
The rates are maximized over the permissible ring-radius ratios. 

Rather than allowing arbitrary p(x), we optimize our system 
under the constraint of the shaping techniques described in 
SectionU!!] For a particular constellation X, number of shaping 
bits g, and (n s ,k s ) shaping code, we numerically evaluate 
the information rate I(X; Y). While the evaluation could be 
done using a Monte Carlo integration [18], we use the Gauss- 
Hermite quadratures method of evaluation described in ||2TI . 

We begin by evaluating the information rate under the 
assumption that the input symbols have a uniform distribution; 
i.e., the symmetric information rate. For each M, we limit the 
ring-radii of the APSK constellation to be chosen from among 
the values specified in the DVB-S2 standard. We further limit 
the inter-ring phase offsets to be the same as in DVB-S2, as 
the phase offsets have been shown to have a negligible effect 
on the information rate [2|. The symmetric information rate, 
maximized over the permissible ring-radius ratios, is shown 
for 16-APSK and 32-APSK in Fig. [3] 

Next, we compute information rates when shaping is used. 
For each constellation X, we consider all shaping codes with 
n s < 20 and k s < 10. Larger values are not considered due 
to their high decoding complexity (which is exponential in 
k s ). For each shaping code, we determine the corresponding 
value of po. For a given number of shaping bits g and the 
shaping strategy given by Section [TTTJ we determine the pmf 
p(x) and compute the corresponding information rate. The 
total number of distinct po that we consider is 121, and we 
repeat this exercise for each permissible value of 7. For each 
M and g, we determine the maximum information rate at each 
value of £ s /Nq, where the rate is maximized over both the 
permissible po and 7. 

Based on our search, we have found that shaping gains of 
up to 0.32 dB are achievable. Fig. [3] shows the achievable in- 
formation rates with shaping for 16- and 32-APSK in AWGN, 
optimized over the shaping code and ring-radius ratios. The 
shaping gain, which is the dB difference between the capacity 
curves of the uniform and shaped systems, is shown in Fig. 
|4] for different values of g. The results for 16-APSK show 
the benefit of using two shaping bits over just one shaping 



0.35 




0.5 1 1.5 2 2.5 3 3.5 4 4.5 5 



Information Rate 

Fig. 4. Shaping gain (in dB) of 16-APSK and 32-APSK in AWGN as a 
function of the overall information rate R (in bits/symbol). 

TABLE I 

Minimum required £ b /No in AWGN for M-APSK with g shaping 

BITS. THE OPTIMAL po AND 7 ARE SHOWN. 



M 


9 


R 


£ b /N 


gain 


Po 


7 


16 


1 


3.09 


4.714 dB 


0.091 dB 


0.623 


2.70 


2 


2.95 


4.077 dB 


0.322 dB 


0.688 


2.57 




1 


3.88 


5.915 dB 


0.265 dB 


0.716 


{2.64,4.64} 


32 


2 


4.06 


6.517 dB 


0.175 dB 


0.623 


{2.53,4.30} 




3 


3.89 


5.898 dB 


0.310 dB 


0.656 


{2.53,4.30} 



bit. From these curves, it is clear that if shaping is used at 
all, then it is advisable to use two shaping bits. The results for 
32-APSK show that while three shaping bits is better than just 
one, using two shaping bits is actually worse than one. This is 
because using just one shaping bit segments the constellation 
into two natural subconstellations: One containing the two 
innermost rings and the other containing just the outer ring. 
On the other hand, two shaping bits creates an awkward 
partitioning that results in symbols in the middle ring being 
picked with different probabilities. Thus, it is not advisable to 
use two shaping bits in the 32-APSK case. Furthermore, the 
incremental gain of using three shaping bits over one shaping 
bit is negligible (see the leftmost pair of curves in Fig. [3), and 
it is recommended that 32-APSK systems use just one shaping 
bit. This result is intuitive: with one shaping bit, the system 
will simply choose from the outer ring or the inner two rings. 

The results from the optimization are tabulated in Table U 
For each value of M and g, the table shows the rate R for 
which shaping has the highest gain, along with the values 
of po and 7 that achieve the gain. The value of £b/No for 
which the shaped constellation achieves rate R is listed, where 
£b/N = (1/R)(S S /N ) is the SNR per information bit. 

Fig. [5] shows the value of po that maximizes the information 
rate as a function of SNR. The optimal po decreases as the 
SNR increases, reaching a floor at po = 0.6230. This minimum 
value is a consequence of the constraints on n s and k s and 
corresponds to a (n s ,k s ) = (11,10) shaping code. While a 
(k s + l,k s ) code with larger k s could be used to obtain a 
lower po, the resulting shaping gain will be too small to merit 
the complexity of such a large code. 
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Fig. 5. The value of po that maximizes the information rate for nonuniform 
APSK in AWGN. For 16-APSK, 1 and 2 shaping bits are used, while for 
32-APSK, 1 through 3 shaping bits are used. 
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Fig. 6. Peak-to-average power ratio (PAPR) of nonuniform 16-APSK and 
32-APSK as a function of po and the number of shaping bits. For 16-APSK, 
7 = 2.57, and for 32-APSK, 7 = {2.64, 4.64}. 

B. PAPR Constraints 

The peak-to-average power ratio (PAPR) is a consideration 
when nonlinear power amplifiers must be used. The PAPR is 
defined as 



PAPR 



max |M 2 ) 

x£X U ' 

EM 2 ] 



max |M 2 ) 



x£X 



p(x)\x\ 



(13) 



For a fixed average energy, shaping spreads the signals further 
apart, which increases the numerator of ( f]~3b . It follows that 
shaping will increase the PAPR for a particular constellation 
X. This behavior can be seen in Fig. [6] which shows the 
relationship between PAPR and po for the shaping strategies 
considered in this paper. 

At first glance, it would seem that shaping will dramatically 
increase the PAPR. This is true if we were to use the same X 
both with and without shaping. However, when we performed 
our optimization, we let the ratio of ring radii vary among 



their permissible values. When we did this, we found that the 
optimal ratio of ring radii were always smaller with shaping 
than without. Furthermore, the PAPR is smaller for smaller 
ring-radii ratios. Thus, in achieving the balance between the 
ratio of ring radii and the value of po, the PAPR increase is 
not significant. For example, in the case of 16-APSK with 2 
shaping bits, we found that the PAPR with shaping is 1 .98 dB, 
while for the uniform case, which was optimized at the largest 
value of 7, was 1.11 dB, and thus only 0.86 dB additional 
PAPR is required when shaping. 

V. Results with LDPC Coding 

In this section, we consider the AWGN bit-error perfor- 
mance of a system with the proposed shaping technique 
when an actual LDPC code is used. As a running illustrative 
example, we consider a system that uses 32-APSK modulation 
operating at a coded rate of 3 bits/symbol. Initially, we pair 
the shaped system with a code from the DVB-S2 standard 
(T). However, the DVB-S2 codes were not optimized for use 
with the proposed shaping technique. Thus, we next optimize 
the codes by using EXIT-based techniques similar to those 
proposed in lfl4l . and show that the optimization provides 
an additional performance improvement. Finally, we make 
some observations on the complexity of the overall system, 
as compared with a uniform (non-shaped) system. 

A. Standardized DVB-S2 Codes 

We begin by comparing the performance of shaped and 
uniform systems that use LDPC codes from the DVB-S2 
standard [1|. The APSK ring radii are selected to maximize 
the corresponding CM capacity, as described in Section [TV] 
For both systems, the overall rate is R = 3 bits/symbol. To 
achieve this rate, the uniform system must use a rate R c = 3/5 
LDPC code, and so the (N C ,K C ) = (64 800,38 880) code 
from the DVB-S2 standard is adopted. The shaped system 
uses g = 1 shaping bit per symbol, which was found in 
Section [IV] to offer high shaping gain while avoiding the 
system complexity of using multiple shaping bits per symbol. 
A rate R s — 1/2 shaping code is used with (n s , k s ) = (4, 2), 
which provides a reasonably good po while still allowing the 
standardized LDPC code rates R c from the DVB-S2 standard 
to be used. From (Q~|i, when g = 1 and R s = 1/2, the 
LDPC code rate must be R c = 2/3 for the overall rate to 
remain R = 3 bits/symbol, and so the shaped system uses the 
(JV C , K c ) = (64 800, 43 200) code from the DVB-S2 standard. 

The parameters used in the simulations are listed in Table iLTl 
The column marked 7 shows the ring-radii ratios used for both 
the uniform and shaped systems, while the LDPC and shaping 
code rates are listed in the columns marked R c and R s , 
respectively (with R s = 1 indicating cases when no shaping 
is used). For each system, we consider the performance of 
several LDPC codes. For this subsection, only those LDPC 
codes in the rows marked "standard" in the "LDPC code" 
column are of interest. The column marked "information- 
theoretic minimum" Eb/N^ lists the CM-capacity bound; i.e., 
the minimum £f, /No for which the CM capacity is equal to R 
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TABLE II 

Parameters used for the 32-APSK simulation. 
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LDPC 
code 


Rc 


Rs 


information- 
theoretic 

£b/N 


£ b /N 
at BER 
10~ 5 


Uniform 


{2.64,4.64} 


standard 


38880/64800 


1 


4.029 dB 


5.42 dB 


optimized 


38880/64800 


1 


4.029 dB 


5.28 dB 


Shaped 


{2.64,4.64} 


standard 


43200/64800 


2/4 


3.829 dB 


4.96 dB 


optimized 


43200/64800 


2/4 


3.829 dB 


4.80 dB 


optimized 


41661/64806 


2/3 


3.789 dB 


4.62 dB 
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Fig. 7. Bit-error rate of 32-APSK in AWGN at rate R = 3 bits/symbol. 
The solid lines are for the uniform system; from right to left, the curves show 
BICM system, BICM-ID system and BICM-ID system using optimized LDPC 
code. The dashed lines are for the shaped system with different combinations 
of LDPC codes and shaping codes. 

(see Section [IV] for a discussion about how the CM capacity 
is computed). 

The uniform and shaped systems were simulated over an 
AWGN channel with 100 iterations of decoding. For the 
uniform system, two receiver implementations are considered. 
The first implementation is a BICM receiver, which passes the 
bit-likelihoods produced by the APSK demodulator into the 
LDPC decoder without any feedback from the decoder to the 
demodulator. This is typical of most standard implementations 
of DVB-S2 receivers. The second implementation is a BICM- 
ID receiver, which feeds back soft information from the LDPC 
decoder back to the APSK demodulator to allow it to refine 
its bit-likelihoods [16]. The shaped system uses the iterative 
receiver described in Section IH-BI which involves the itera- 
tive exchange of information among the APSK demodulator, 
shaping decoder, and LDPC decoder. 

Fig.[7]shows the simulated bit-error rates of: (1) the uniform 
system with the rate R c = 3/5 DVB-S2 standardized LDPC 
code and BICM reception (rightmost curve); (2) the same 
uniform system but with BICM-ID reception (second curve 
from right); and (3) the shaped system using the R c = 2/3 
DVB-S2 standardized LDPC code and iterative receiver (third 
curve from left). In addition, the last column of Table [II] 
shows the £b/No required for the simulated BER to equal 
10 -5 with iterative decoding, which is 5.42 dB and 4.96 
dB for the uniform and shaped system, respectively. From 
Fig. [7] it is observed that the uniform system requires an 
£b/No equal to 5.75 dB to achieve a BER equal to 10 -5 



when the BICM receiver is used. Thus, the shaped system 
achieves a gain of 0.79 dB relative to the uniform system 
with BICM reception. Of this gain, 0.33 dB can be attributed 
to iterative demodulation and decoding (i.e., using a a BICM- 
ID receiver) while the remaining 0.46 dB can be attributed 
to shaping and the use of the shaping code. Notice that this 
gain is actually higher than the 0.2 dB shaping gain predicted 
from the information theory (see the difference between the 
corresponding entries in Table HH. Loosely speaking, this extra 
gain is due to an additional coding gain that arises when 
using the shaping code in an iterative receiver. More precisely, 
and based on our EXIT-based analysis (discussed in the next 
subsection), we have observed that the inclusion of the shaping 
code in the system has the additional benefit of increasing the 
slope of the variable-node EXIT curve, thereby driving down 
the convergence threshold. 

B. Code Optimization for Uniform Modulation 

As described in E3, the LDPC codes used by DVB- 
S2 were designed for efficient memory access. They were 
not optimized with respect to the APSK modulation, and 
were certainly not designed with shaping in mind. Both the 
APSK modulation and shaping may be taken into account by 
using the EXIT-based technique of fl4l . which will provide 
optimized degree distributions for the code. In this and the next 
subsection, we describe how we have optimized the LDPC 
code's degree distributions and demonstrate the resulting BER 
improvement. 

We begin by discussing how the EXIT-based technique of 
|[l4l can be used to optimize the LDPC code used by the 
uniform APSK-modulated system. Our goal here is to produce 
a redesigned LDPC code that has the same complexity as 
the code specified by DVB-S2, yet offers improved BER 
performance when used with APSK modulation. Like the 
DVB-S2 code, we limit the LDPC code to be an extended 
irregular repeat accumulate (eIRA) code ll23l . which facilitates 
systematic encoding. The parity-check matrix of such a code 
contains a dual-diagonal matrix as its last n — k columns and 
an arbitrary matrix for its first k columns. We further limit 
the code to be check regular, and require every check node to 
have degree equal to the constant check-node degree d c used 
by the DVB-S2 standardized code of the same rate and length. 
For instance, in the case of the (N C ,K C ) = (64 800,38 880) 
code, the check-node degree is a constant value of d c = 11. 

Since the check-node degree is fixed, the degree optimiza- 
tion problem is to select the optimal variable-node degrees. 
We adopt the notation of |[T4| and allow D different variable- 
node degrees d v ^, i = 1, D. The fraction of nodes having 
degree d v .i is denoted a*, while the fraction of edges incident 
to variable nodes of degree d Vt i is denoted As in [14] and 
the DVB-S2 standard, we limit the number of distinct variable- 
node degrees to D = 3, starting with d Vt % — 2. We set a 
fraction ai — (n — k)/n of nodes to be of degree d v> i = 2, 
corresponding to the dual-diagonal part of the H matrixQ- We 

*In DVB-S2, there is a single degree-1 variable node corresponding to the 
last column of the H matrix. While our code design maintains this degree-1 
node, we do not explicitly list the fraction of nodes connected to degree-1 
nodes since it will approach zero for large N c . 
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then pick a pair {d v< 2, d v ^}, where d v .2 is selected to be either 
3 or 4 and d v< 3 is an integer no greater than 25. 

The optimization requires two EXIT curves to be generated, 
one that corresponds to the variable-node decoder (VND) and 
another that corresponds to the check-node decoder (CND). 
The goal is then to fit the two curves together by picking 
appropriate variable-node degrees. The VND curve charac- 
terizes not only the variable nodes of the LDPC code, but 
also the characteristics of the modulation. The overall VND 
curve is created by first generating a transfer characteristic for 
just the modulation and its detector at the given £ s /Nq. This 
is the detector characteristic Ie,det{Ia,£s/No)> where the 
mutual information I a is computed between the demodulator's 
a priori inputs and the modulator's input bits. When 1& = 0, 
Ie,det is the BICM capacity of the modulation [ 14], [ 18 1. For 
APSK, Ie,det cannot be generated in closed form for nonzero 
I A, and thus it is generated through Monte Carlo simulation 
under the assumption that the demodulator's a priori input is 
conditionally Gaussian. The VND curve for degree-d v nodes 
is found from the detector characteristic using 

-Te.vnd (lA,d v ,£ s /No) = 

J (y {d v - l)[J- l {I A f] + [J-H^detO^M.))] 2 ) 

(14) 

where the J-function is given in [ 1 3 1 and can be computed 
using the truncated series representation of [24|. Note that (fT4l 
only gives the EXIT curve for a single variable-node degree 
d v , and therefore represents the VND curve for a regular code. 
In the case of an irregular LDPC code, the VND curve is found 
by using |[T4l 

D 

Ie,vnd ,vnd {I a, d v> i, £ s /No) . 

(15) 

The CND curve is found by using |[T4l 

/b,cnd (Ia, d c ) = 1 — J (yi c - 1 • J-\l - I A )) (16) 

where Ia is the mutual information at the input of the check 
nodes. The EXIT chart is drawn by noting that the Ie,cn~d 
produced by the CND becomes the Ia at the input to the 
VND (which we denote Ia,vnd), while the Ie.vnd produced 
by the VND becomes the I a at the input to the CND (which 
we denote Ia,cnd)- The chart plots the VND and CND curves 
with I a, vnd = ^e,cnd on the horizontal axis and Ie.vnd = 
-7a, cnd on the vertical axis. For a given degree distribution, 
VND curves are generated for several £ s /Nq and the threshold 
is determined to be the value of £ s /Nq for which the VND 
and CND just barely touch. 

By considering different combinations of {d v .2,d v ^} and 
the corresponding variable-node degree distributions that sat- 
isfy the check-node degree constraint, code designs were 
identified with low thresholds. One code found to be better 
than the standard rate R c = 3/5 DVB-S2 code when used 
with 32-APSK has the following degree distribution: 



d v ,i 


= 2 


ai 


= 0.40 bi 


= 0.182 


d v ,2 


= 4 


«2 


= 0.52 b 2 


= 0.473 


d v ,3 


= 19 


«3 


= 0.08 6 3 


= 0.345 



The uniform 32-APSK system was simulated using a code 
with this degree distribution, and the resulting BER with 
BICM-ID reception is shown in Fig. [7] (third curve from right; 
labeled "BICM-ID uniform with optimized 3/5 LDPC code"). 
This code is also listed as the second row of Table [TH which 
indicates a gain of 0.14 dB relative to the DVB-S2 standard 
code at BER 10~ 5 . 



C. Code Optimization for Shaped Modulation 

When the constellation is shaped, the shaping encoder 
(or decoder) is absorbed into the variable-node encoder (or 
decoder), and thus the contribution of the shaping code is taken 
into account by the VND curve. The system effectively uses 
a larger constellation with high dimensionality that combines 
shaping with conventional modulation. For instance, when g = 
1 and a (N s , K s ) shaping code is used, K s + (m — 1) x N s bits 
are used to select a sequence of N s symbols. Grouping these 
correlated symbols together, the resulting super-constellation 
contains 2 Ks+ ( m ~ 1 ' )Ns symbols, each represented by 2N S real 
dimensions (or N s complex dimensions). Whereas generating 
the detector characteristic for the uniform case involves in- 
dependently modulating symbols and measuring the mutual 
information between the modulator input and demodulator 
output, the characteristic of the shaped modulation must be 
generated with the shaping taken into account. In reference to 
Fig. Q] an unbiased independent bit sequence v is generated 
and passed through the pictured processing to produce the 
modulated sequence x. The received noisy symbols y are 
processed to produce the extrinsic information L e (v), and 
the detector characteristic is found by computing the mutual 
information between v and £ e (v). 

Aside from accounting for the shaping code in the detector 
characteristic, the EXIT curves are generated exactly the same 
way as for the uniform case, and the variable-degree distribu- 
tions are optimized using the same methodology. For operation 
at 3 bits/symbol, a system that uses a (N S ,K S ) — (4,2) and 
rate R c — 2/3 LDPC code was optimized, and the resulting 
degree distribution was found: 



d v ,i 


= 2 


a i 


= 0.333 h 


= 0.200 


d v ,2 


= 3 


a 2 


= 0.606 b 2 


= 0.546 


d v ,3 


= 14 


0,3 


= 0.061 b 3 


= 0.254 



Like the rate-2/3 DVB-S2 code, this code has a constant check- 
node degree of d c = 10. The VND and CND curves for this 
LDPC code with shaped 32-APSK modulation are shown in 
the EXIT chart of Fig. [8] at £ b /N a = 4.73 dB. The shaped 
system with was simulated using an LDPC code with this 
degree distribution, and the BER curve is shown in Fig. 
(second curve from left). This code is also listed as the fourth 
row of Table HU which indicates a gain of 0.16 dB compared 
with the system that uses the same shaping code along with 
the standardized rate R c = 2/3 DVB-S2 code. 
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Fig. 8. EXIT chart for BICM-ID using shaped 32-APSK and overall system 
rate R=3 in AWGN at £b/No = 4.73 dB. 

In the previous examples, the code rate R c is chosen from 
the set of code rates found in the DVB-S2 standard. However, 
due to (Q]l, this limits the set of possible shaping-code rates 
for a given R, and may result in the use of a suboptimal 
Pq. For instance, at an overall rate R — 3 bits/symbol and 
g = 1 shaping bit, then R c = 2/3 and R s = 1/2. Using the 
(n s ,k s ) — (4,2) shaping code will satisfy this requirement, 
but the resulting value of po is 0.8125, which differs from 
the optimal value of po — 0.716 identified in Section [TV] A 
(3, 2) shaping code gives a po = 0.75, which is closer to the 
optimal, but requires LDPC code rate R c — 9/14 to achieve 
an overall rate of R = 3 bits/symbol. While the standardized 
DVB-S2 codes do not support this code rate, a new code can 
be designed for this rate by using the EXIT-based techniques 
presented in this section, resulting in the degree distribution: 



d v ,i 


= 2 


ai 


= 0.357 h 


= 0.200 


d v ,2 


= 3 


a-2 


= 0.558 b 2 


= 0.469 


d v ,3 


= 14 


«.3 


= 0.085 b 3 


= 0.331 



As with the rate-2/3 codes, the code has a constant check- 
node degree of d c = 10. The VND curve for this code is also 
shown in Fig. [8] and the BER performance is shown in Fig. 
|7] (leftmost curve). The code is listed as the last row of Table 
llll which indicates a gain of 0.18 dB at BER 10" 5 compared 
with the system that uses the optimized rate R c — 2/3 shaping 
code along with a (4, 2) shaping code. 

D. Complexity Considerations 

Instead of using shaping as a way to improve energy 
efficiency, it can be used as a way to reduce the required 
receiver complexity. This feature can be seen in Fig. [9] which 
shows the average number of iterations required for the shaped 
and uniform systems to converge (i.e., correct all errors in 
a frame) in AWGN channel for the same cases whose BER 
curves were given in Fig. [7] This is a useful metric when the 
receiver operates using an automatic halting mechanism, for 
instance, if it performs syndrome-based error detection after 
each iteration. Notice that, at a given S^/Nq, the shaped system 
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Fig. 9. Average number of BICM-ID global iterations required for correcting 
all codeword errors. 

needs fewer iterations. While the per-iteration complexity 
of the shaped system is higher than that of the uniform 
system, the need for fewer iterations may result in a lower 
overall complexity when early halting techniques are used. 
For instance, at Eb/N^ = 5.4 dB, which is a typical operating 
point for the unshaped system, an average of 25.3 iterations 
are required in AWGN for the unshaped system with BICM- 
ID reception. At the same operating point, the shaped system 
with a DVB-S2 standard LDPC code and an optimized LDPC 
code only requires about 18 and 15 iterations to converge, 
respectively. 

The overall and per-iteration complexity of the two systems 
can be compared by relating the actual execution time of their 
implementations. The simulation results were produced with 
software implementations of the systems running on a PC 
computer with a single-core 3.40 GHz Pentium 4 processor 
and the Windows XP operating system. For each system, 
the simulation's execution time, total number of data bits 
simulated, and total number of decoder iterations executed 
were logged. From this information, the throughput of the 
implementation was quantified in units of bit-iterations per 
second (bps-iteration). For the uniform system with rate-3/5 
DVB-S2 standardized coding, the processing rate is 2036.5 
bps -iteration, while for the shaping system, the processing rate 
when using the DVB-S2 standard code is 1805.7 bps -iteration. 
The processing rate for optimized LDPC code with rate 2/3 
and 9/14 is 1698.3 and 1627.1 bps -iteration, respectively. 
While shaping reduces the per-iteration throughput, the overall 
throughput may be increased due to the need for fewer iter- 
ations. For instance, using the previously mentioned example 
operating point of E^/No = 5.4 dB over the AWGN channel, 
the uniform system has an average throughput of 78.1 bps 
while the shaped system's throughput using the DVB-S2 
standardized LDPC code is 100 bps and for the optimized 
code is around 108 bps. 

VI. Conclusion 

The combination of APSK modulation and LDPC coding is 
well suited for the constellation shaping technique promoted 
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in this paper. To design such a system, the probability po of 
the shaping code and the ring radii of the APSK modula- 
tion should first be optimized using the CM capacity. Next, 
the LDPC code should be optimized by using EXIT charts 
to select the variable-node degree distributions (assuming 
a constant check-node degree). The system will require an 
iterative receiver, which iteratively demodulates the APSK 
signal, decodes the shaping code, and decodes the LDPC 
code. When the po is appropriately selected and the LDPC 
code is optimized, the resulting system with iterative reception 
outperforms a standard DVB-S2 system (which does not use 
iterative demodulation and decoding) by over 1 dB in AWGN 
with 32-APSK at a rate of 3 bits/symbol. This gain is due 
to three factors: (1) using iterative reception, (2) shaping, 
and (3) optimization of the LDPC code. The relative gain 
of each of these three factors is 0.33 dB, 0.46 dB, and 0.34 
dB, respectively. This is competitive with the shaping gains 
previously found for convolutionally coded QAM 1 1 1 1 and 
turbo-coded PAM 1121 . While shaping increases the required 
complexity per iteration, the need for fewer iterations opens 
up the opportunity for reduced overall complexity. While we 
constrain the number of symbols per ring to match the numbers 
required DVB-S2, other configurations could be considered. 
However, we note that the proposed system will work best 
with a single shaping bit if the number of symbols on the 
outer ring is equal to the number of symbols in the inner 
rings, as is the case for the 32-APSK configuration specified 
by DVB-S2. 
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